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ABSTRACT - As the current speech enhancement 
algorithms can give results for improved speech 
audibility only. So, our main motivation is to 
develop speech enhancement algorithm that would 
improve performance in speech for hearing 
impaired persons. In this paper, an algorithm is 
presented for improving speech intelligibility. 
Various speech enhancement algorithms were 
developed but only some of them can be used for 
real time hearing aid applications. This proposed 
algorithm can be used for practical hearing 
prosthetic devices. Implementation of the binary 
masking algorithm uses a bank of band-pass filters 
to perform mapping of signals. Also, classification is 
performed with a signal-to-noise (SNR) estimate and 
a comparator. This includes spatial filtering method, 
classification of signals such as original and noisy 
signal. After this based on SNR threshold level 
signals are recombined to obtain reduced noise 
level in speech signal. In this, matlab implementation 
of binary mask algorithm is provided which shows 
better results for speech intelligibility as compared 
to other algorithms. 

Keywords — Binary Mask Algorithm (BMA), Signal-to- 
noise ratio (SNR), Speech intelligibility, Speech quality, 
Time-Frquency (T-F), Independent component analysis 
(ICA). 

I. INTRODUCTION 

Hearing impaired listeners find more difficulty in 
distin- guishing speech in noisy environments as 
compared to persons with normal hearing. While 
using correct hearing devices hear- ing impaired 
listeners find it difficult in interpreting speech. The 
existing methods used in available speech algorithms 
gives better results for quality of speech signal. 
While clarity of speech will give poor performance. 
Because of this hearing impaired listeners having 
problems in communication. This speech 
intelligibility problem is significantly removed by 
using binary mask algorithm. This proposed method 
is suitable for hearing aid application. Time 
frequency masking technique is introduced in this 
algorithm. Time frequency masks often take the 
values of ’zero’ and ’one’, resulting in mixture of 
two signals. Binary mask is calculated in 


classification stage and noise signal is removed. 
Second, methods used for calculating masks are 
based on computational auditory scene analysis [15] 

II. COMPARISON OF PROPOSED M 
ETHOD WITH EXISTING METHODS 

The existing methods used in hearing aids 
improve only speech quality of signals and ease 
of listening. But the speech intelligibility 
improvement for hearing aid application is limited. 
So, it is important to develop speech separation 
algorithms for hearing aid application that [15] will 
increase the intelligibility of the speech more 
effectively. 



Fig. 1 . Binary Mask Algorithm for enhancing speech 
intelligibility. 

The basic technique is the time frequency 
masking for sound separation. In speech 
enhancement Wiener filter can be used as time 
frequency mask where each mask gives relation 
between original speech signal and mixture of 
original and noisy signals. Wang and Brown [1] 
presented a monaural time-frequency masking 
algorithm. In this single microphone is used for the 
speech processing. Monaural algorithms uses sound 
properties for speech signal separation or 
auditory scene analysis. There are two types of 
algorithms: feature- based and model based. In 
both these algorithms binary or ratio (soft) 
mask is produced. Time frequency masking 
technique is based on two aspects: auditory 
masking and sparseness in the signals in T-F 
domain. P. Scalart and Filho [2] presented Wiener 
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algorithm depends on signal-to-noise ratio 
calculation and two spectral-subtractive algorithms 
based on reduced delay convolution [3], In this 
algorithm, for sentence segmentation overlapping 
method is used with 160 samples and 50 % 
overlap is obtained. Hann window technique and 
discrete Fourier transform (DFT) is used for each 
segment. Performance of subtractive algorithms 
[7], the performance of Wiener algorithm is 
substantially higher. So, in hearing aid prosthetics 
Wiener algorithm is preferred than spectral- 
subtractive algorithm. J. Li, S. Sakamoto, S. Hongo, 
M. Akagi and Y. Suzuki [4] presented binaural 
algorithm for speech im-provement. The binaural 
algorithm is based on both CASA and ICA analysis 
and uses two or more microphones for speech 
separation. These algorithms only improves speech 
quality but no significant results are obtained for 
speech intelligibility. The proposed algorithm i.e. 
binary mask algorithm is suitable for real-time 
implementation of hearing prosthetics. V. Hanson 
[5] presented a binary mask algorithm that has a 
large potential for improving speech performance 
in background noise. In this algorithm, 
performance is effective even if the noise is not 
sparse in the time-frequency region. The whole 
SNR of the speech signal can be improved by 
discarding those regions of the time-frequency 
domain whose SNR is fails to go beyond a definite 
threshold level. 

III. SYSTEM IMPLEMENTATION 

Time-frequency masking is a widely used 
technique for speech and signal processing used in 
automatic speech recog- nition [5]. As shown in the 
lock diagram, mixture of two signals i.e. original 
speech and signal with noise acts as input for next 
stage. Mapping of speech from time to time 
frequency region is done. After that binary mask is 
created and with the help of SNR thresholding 
stage noise will be removed. Finally clean speech 
signal is obtained. 


A. Auditory Masking 

As time frequency masking technique is based 
on scarcity of speech signals in T-F domain. 
Because of this, overlap- ping of noise with T-F 
domain region containing original speech signal is 
very difficult. Background noise which is very 
close to speech containing T-F regions that lowers 
the speech perceptibility by the psychoacoustic 
phenomenon i.e. auditory masking. Performance 
of speech intelligibility de- creases because of 
auditory masking even in normal SNR levels. 
Hearing impaired listeners find it more difficult 
due this auditory masking and loss of speech 
intelligibility. But those with normal hearing can 
handle this effect. 

B. Improving Speech Intelligibility 

Background noise is removed effectively by 
using binary mask algorithm. Noisy signals in the 
time frequency region masks original speech 
signal. Also it reduces speech clarity of signal. In 
this algorithm, in first step input signal in time 
domain is mapped T-F domain. The next step 
creates the 

binary mask by separating original speech signals 
from noisy signals. After this thresholding is done, 
recombined all the originals signal by removing 
noisy regions. 

C. Binary Mask Algorithm Challenges 

Spectral resolution is directly dependent on 
performance of binary mask algorithm. In spatial 
filtering stage, operations are performed 
sequentially but it requires higher clock rates. The 
chain of sequentially executed instructions are 
pipelined together. Memory requirement and 
hardware distributions will be reduced with proper 
selection of filter structure. 

IV. EXPERIMENTAL ANALYSIS 
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Fig. 2. Block diagram of binary mask algorithm. 


A. Steps for Binary Mask Algorithm 

1) Setting SNR: Noisy speech signals were 
generated in Matlab using a weighted sum of 
clean speech and noise. SNR levels were 
adjusted by changing the weight applied to the 
noise signal. SNR was measured using the 
global SNR output from the snrseg function in 
the Voice-box toolbox. 
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2) Audio record function : By using this function 
various speech samples are recorded with 
different background noise. These speech 
samples are real-time signals as it includes 
the target speech and background noise. The 
noise will be removed after applying binary 
mask algorithm. 

3) Speech samples are recorded using single 
microphone only. Variables are initialised in the 
algorithm. 

4) Allocate memory for various variables and 
noise mag- nitude is calculated. 

5) Noise estimation is done for different 
parameters and divide real-time speech samples 
into frames. 

6) Perform the short time Fourier transform 
(STFT) anal- ysis and compute true STFT noise 
spectrum. 

7) Apply binary mask algorithm and create 
modified spec- trum and perform sanity check 
and reconstruct clean speech. 

8) Apply inverse STFT and truncate the extra 
paddings used i.e. back to original signal 
length. 

B. Implementation of Binary Mask algorithm 

In this, speech samples are read from the audio 
database. Then the samples are stored in new 
variable in Matlab. Next step is to create noisy 
speech signal, for this noise is added to the clean 
speech signal(original speech signal). Create time 
vector for vector to frame conversion and vice 
versa. After this binary mask algorithm is applied to 
it. Output of the binary mask algorithm is speech 
samples which is written to the wave file for further 
playback. After listening the processed speech 
samples and noisy speech samples, conclusions are 
made that speech samples are enhanced effectively 


and background noise is removed significantly. 



Fig. 3. Flow chart for Binary Mask algorithm. 

V. RESULTS AND DISCUSSION 
A. Simulation Results 

The target signals used for testing the algorithm 
are taken from real-time recording. Also the target 
signals can be ob- tained from the TIMIT Acoustic 
Phonetic Continuous Speech Corpus. The specific 
noise chosen include babble (speech shaped 
noise), white noise and restaurant noise. Speech 
sim- ulation is done in Matlab. SNR values can be 
adjusted for various speech samples. SNR can be 
measured using global SNR output from snrseg 
function in voicebox toolbox MAT- LAB. 

B. Algorithm demo using GUI 

This demo provides a simple GUI for basic 
filtering of audio data. Using the GUI having 
advantages as following: 1. Load audio data stored 
in a .wav file. 2. Real-Time speech signals are 
recorded using binary mask algorithm. In this GUI, 
speech can be recorded or it can be loaded using 
stored database files. Add the filtered noised to the 
original audio signal. 4. Remove the noise by 
inverse filtering. 5. The purpose of this DEMO is 
not to provide robust and sophisticated denoising 
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algorithms, but simply to demonstrate some basic 
audio filtering processes in Matlab. In the above 
figure a screen shot of the demo is shown. In the 
first panel (row) you can see the plotted audio 
signal and the respective spectrogram. In the 
second panel 
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Fig. 4. Binary Mask Algorithm demo using GUI. 


binary mask output waveform is shown and along 
with its spectrogram. In this GUI demo, real-time 
speech samples can be obtained by real-time 
recording. Also speech samples can be saved and 
can be used for future use. In the third panel, Power 
Spectral Density (PSD) for both real-time speech 
signal and binary mask output signal is shown 
respectively. 



Timed) 


Fig. 5. Unprocessed and processed waveforms for sample 1. 

In figure 7, the graph shows the percentage of 
improvement for binary mask algorithm over 
different speech enhancement algorithms. The 
speech samples are recorded real-time and for 
comparison 10 samples are taken. 

VI. CONCLUSION AND FUTURE WORK 


This proposed algorithm helps to enhance the 
clarity of speech signals having noise much 
higher as compared to other algorithms. Binary 
mask algorithm improves both speech quality as 
well as speech intelligibility. The proposed method 
is characterized by the following: 
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Fig. 6. Unprocessed and processed waveforms for sample 

1 implementation of bank of band-pass filters and 
binary mask algorithm improves as the frequency 
span of filter bank increases, which causes speech 
intelligibility to increase and noise sensitivity to 
decrease.. 



Fig. 7. Improvement of Binary Mask algorithm over other 
methods. 

Filter bank is implemented in frequency range of 
200 Hz to 10 kHz. 

2) Filters are designed using Matlabs filter design 
tool. 

3) For calculating binary mask, target+noise 
mixture and noise signals are performed in parallel 
using two separate banks of 

28 band-pass filters. 

4) Binary mask signal is created in clas- sification 
stage. Classification calculations are pipelined with 
spectral stage calculations to reduce the total 
processing time. 


ISSN - 2394-0573 


All Rights Reserved © 2016 IJEETE 


Page 129 








International Journal of Exploring Emerging Trends in Engineering (IJEETE) 
Vol. 03, Issue 02, Mar-Apr, 2016 Pg. 126 - 131 WWW.IJEETE.COM 


5) Target signals and noise signals are taken from 
databases. Future work can improve the proposed 
method by realizing on a low power DSP or in a 
low power ASIC. Binary mask algorithm can give 
give better results for speech intelligibility in low 
frequency range. 
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